智能论文笔记

Modeling Motivational Interviewing Strategies On An Online Peer-to-Peer Counseling Platform

Raj Sanjay Shah , Faye Holt , Shirley Anugrah Hayati , Aastha Agarwal , Yi-Chia Wang , Robert E. Kraut , Diyi Yang

分类：人工智能

2022-11-09

Millions of people participate in online peer-to-peer support sessions, yet there has been little prior research on systematic psychology-based evaluations of fine-grained peer-counselor behavior in relation to client satisfaction. This paper seeks to bridge this gap by mapping peer-counselor chat-messages to motivational interviewing (MI) techniques. We annotate 14,797 utterances from 734 chat conversations using 17 MI techniques and introduce four new interviewing codes such as chit-chat and inappropriate to account for the unique conversational patterns observed on online platforms. We automate the process of labeling peer-counselor responses to MI techniques by fine-tuning large domain-specific language models and then use these automated measures to investigate the behavior of the peer counselors via correlational studies. Specifically, we study the impact of MI techniques on the conversation ratings to investigate the techniques that predict clients' satisfaction with their counseling sessions. When counselors use techniques such as reflection and affirmation, clients are more satisfied. Examining volunteer counselors' change in usage of techniques suggest that counselors learn to use more introduction and open questions as they gain experience. This work provides a deeper understanding of the use of motivational interviewing techniques on peer-to-peer counselor platforms and sheds light on how to build better training programs for volunteer counselors on online platforms.

translated by 谷歌翻译

Efficient Malware Analysis Using Metric Embeddings

Ethan M. Rudd , David Krisiloff , Scott Coull , Daniel Olszewski , Edward Raff , James Holt

分类：机器学习 | 人工智能

2022-12-05

In this paper, we explore the use of metric learning to embed Windows PE files in a low-dimensional vector space for downstream use in a variety of applications, including malware detection, family classification, and malware attribute tagging. Specifically, we enrich labeling on malicious and benign PE files using computationally expensive, disassembly-based malicious capabilities. Using these capabilities, we derive several different types of metric embeddings utilizing an embedding neural network trained via contrastive loss, Spearman rank correlation, and combinations thereof. We then examine performance on a variety of transfer tasks performed on the EMBER and SOREL datasets, demonstrating that for several tasks, low-dimensional, computationally efficient metric embeddings maintain performance with little decay, which offers the potential to quickly retrain for a variety of transfer tasks at significantly reduced storage overhead. We conclude with an examination of practical considerations for the use of our proposed embedding approach, such as robustness to adversarial evasion and introduction of task-specific auxiliary objectives to improve performance on mission critical tasks.

translated by 谷歌翻译

Deep learning at the edge enables real-time streaming ptychographic imaging

Anakha V Babu , Tao Zhou , Saugat Kandel , Tekin Bicer , Zhengchun Liu , William Judge , Daniel J. Ching , Yi Jiang , Sinisa Veseli , Steven Henke

分类：机器学习

2022-09-20

相干显微镜技术提供了跨科学和技术领域的材料的无与伦比的多尺度视图，从结构材料到量子设备，从综合电路到生物细胞。在构造更明亮的来源和高速探测器的驱动下，连贯的X射线显微镜方法（如Ptychography）有望彻底改变纳米级材料的特征。但是，相关的数据和计算需求显着增加意味着，常规方法不再足以从高速相干成像实验实时恢复样品图像。在这里，我们演示了一个工作流程，该工作流利用边缘的人工智能和高性能计算，以实现直接从检测器直接从检测器流出的X射线ptychography数据实时反演。拟议的AI支持的工作流程消除了传统的Ptychography施加的采样约束，从而使用比传统方法所需的数据较少的数据级允许低剂量成像。

translated by 谷歌翻译

Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations

Mohammad Mahmudul Alam , Edward Raff , Tim Oates , James Holt

分类：机器学习 | 计算机视觉 | (统计)机器学习

2022-06-13

由于对神经网络的运行推断的计算成本，因此通常需要在第三方的计算环境或硬件上部署推论步骤。如果第三方不完全信任，则需要混淆输入和输出的性质，以便第三方无法轻易确定正在执行哪些特定任务。事实证明，存在利用不受信任的政党的协议，但在实践中运行的计算要求太高了。相反，我们探索了一种不同的快速启发式安全策略，我们称之为连接主义符号伪造秘密。通过利用全息降低表示（HRR），我们创建了一个具有伪加密风格的防御的神经网络，从经验上表现出强大的攻击性，即使在不切实际地偏爱对手的威胁模型下也是如此。

translated by 谷歌翻译

Neural Laplace: Learning diverse classes of differential equations in the Laplace domain

Samuel Holt , Zhaozhi Qian , Mihaela van der Schaar

分类：机器学习 | 人工智能 | (统计)机器学习

2022-06-10

神经普通微分方程模型的动态系统，\ textit {ode}由神经网络学习。但是，ODE从根本上是不足以建模具有长期依赖性或不连续性的系统，这些系统在工程和生物系统中很常见。已经提出了更广泛的微分方程（DE）类作为补救措施，包括延迟微分方程和整数差异方程。此外，当通过分段强迫函数对硬质量和odes进行建模时，神经颂歌会遭受数值的不稳定性。在这项工作中，我们提出了\ textit {neural laplace}，这是一个学习不同类别的统一框架，包括上述所有类别。我们没有在时间域中对动态进行建模，而是在拉普拉斯域中对其进行建模，在拉普拉斯域中，可以将历史依赖性和时间的不连续性表示为复杂指数的求和。为了提高学习效率，我们使用Riemann Sphere的几何立体图来诱导Laplace域中的平滑度。在实验中，神经拉普拉斯在建模和推断DES类别的轨迹方面表现出卓越的性能，包括具有复杂历史依赖性和突然变化的DES类别。

translated by 谷歌翻译

Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection

Michael D. Wong , Edward Raff , James Holt , Ravi Netravali

分类：机器学习

2022-06-07

由于技术困难以与原始数据一致的方式更改数据，因此在网络安全域中，数据扩展很少见。鉴于获得符合版权限制的良性和恶意培训数据的独特困难，这一缺陷尤其繁重，而银行和政府等机构会收到有针对性的恶意软件，这些恶意软件永远不会大量存在。我们介绍Marvolo是一种二进制突变器，该突变器以编程方式生产恶意软件（和良性）数据集，以提高ML驱动的恶意软件探测器的准确性。 Marvolo采用语义保护代码转换，模仿恶意软件作者和防御性良性开发人员通常在实践中进行的更改，从而使我们能够生成有意义的增强数据。至关重要的是，语义传播的转换也使Marvolo能够安全地将标签从原始生成的数据样本传播到，而无需规定昂贵的二进制文件的昂贵反向工程。此外，Marvolo通过最大化给定时间（或资源）预算中生成的各种数据样本的密度来最大化，使从业人员最大程度地嵌入了几种关键优化。使用广泛的商业恶意软件数据集和最近的ML驱动的恶意软件探测器进行的实验表明，Marvolo将准确性提高了5％，而仅在潜在的输入二进制文件的一小部分（15％）上运行。

translated by 谷歌翻译

Learning with Holographic Reduced Representations

Ashwinkumar Ganesan , Hang Gao , Sunil Gandhi , Edward Raff , Tim Oates , James Holt , Mark McLean

分类：人工智能 | 机器学习 | 神经与进化计算

2021-09-05

全息减少的表示（HRR）是通过将每个向量与抽象概念相关联，并提供数学操作以操纵向量的方法来执行符号AI的方法，以便操纵向量，就像它们是经典的符号对象一样。这种方法在较旧的象征性AI工作和认知科学之外已经很少使用。我们的目标是重新审视这种方法，以了解它是否可行，以使混合神经象征性的方法能够学习作为深度学习架构的可差分量。由于数值不稳定性，HRRS今天在可分辨率的解决方案中无效，我们通过引入迫使向量存在于空间良好的点中的投影步骤来解决问题。这样做，我们将HRRS的概念检索效果提高超过100美元。使用多标签分类，我们演示了如何利用符号HRR属性来开发能够有效学习的输出层和损耗功能，并允许我们调查HRR神经象征性学习方法的一些优缺点。我们的代码可以在https://github.com/neuromorphiccomputationResearchProgram/learning-with-hotographicuredued-representations

translated by 谷歌翻译

Multistep Electric Vehicle Charging Station Occupancy Prediction using Hybrid LSTM Neural Networks

Tai-Yu Ma , Sébastien Faye

分类：机器学习 | 神经与进化计算

2021-06-09

公共收费站占用预测在开发智能充电策略方面发挥了重要意义，以减少电动车辆（EV）操作员和用户不便。然而，现有研究主要基于具有有限的准确度的传统经济学或时间序列方法。我们提出了一种新的混合长期内记忆神经网络，其包括历史充电状态序列和时间相关的特征，用于多步离散充电占用状态预测。与现有的LSTM网络不同，所提出的模型将不同类型的特征分开，并用混合神经网络架构处理它们。该模型与许多最先进的机器学习和深度学习方法进行了比较，基于从英国邓迪市的开放数据门户网站获得的EV充电数据。结果表明，该方法分别产生非常准确的预测（99.99％和81.87％，分别前进（10分钟）和6个步骤（1小时），优于基准接近的（+ 22.4％）前方预测和6步前方的预测和6.2％）。进行灵敏度分析，以评估模型参数对预测精度的影响。

translated by 谷歌翻译

API design for machine learning software: experiences from the scikit-learn project

Lars Buitinck , Gilles Louppe , Mathieu Blondel , Fabian Pedregosa , Andreas Mueller , Olivier Grisel , Vlad Niculae , Peter Prettenhofer , Alexandre Gramfort , Jaques Grobler

分类：

2013-09-01

scikit-learn is an increasingly popular machine learning library. Written in Python, it is designed to be simple and efficient, accessible to non-experts, and reusable in various contexts. In this paper, we present and discuss our design choices for the application programming interface (API) of the project. In particular, we describe the simple and elegant interface shared by all learning and processing units in the library and then discuss its advantages in terms of composition and reusability. The paper also comments on implementation details specific to the Python ecosystem and analyzes obstacles faced by users and developers of the library.

translated by 谷歌翻译